Learning to Skim Text

نویسندگان

  • Adams Wei Yu
  • Hongrae Lee
  • Quoc V. Le
چکیده

Background. The last few years have seen much success of applying deep networks in many important applications in Natural Language Processing: sentiment analysis, document classification, machine translation, conversational/dialogue modeling, automatic Q&A. An important trait of all of these models is that they read all the text available to them. While it is essential for certain applications, such as machine translation, this trait also makes it difficult to apply these models to applications that have long input text, such as document classification or automatic Q&A. Aim. We consider the problem of long document understanding and propose a modification to the basic neural architectures that allow them to read input text non-sequentially. The main benefit of this approach is faster inference because it skips irrelevant information. An unexpected benefit this approach is that it sometimes helps the models generalize better. Data. The tasks under test include synthetic number prediction (synthetic data), sentiment classification (Rotten Tomatoes and IMDB), news topic classification (AG) and reading comprehension (Children’s Book Test). Those are representative tasks in text reading involving different sizes of datasets and various levels of text processing. Methods. In our approach, the model is a recurrent network, which learns to predict the number of jumping steps after it reads an input token. The model is therefore not fully differentiable, but it can be trained by a policy gradient algorithm called REINFORCE. The reward of the recurrent network is to optimize the accuracy of the model on the training dataset. Anticipated results. The comparison is between the vanilla LSTM and our model. In a nutshell, we anticipate that, while achieving the same testing accuracy, our model is much faster than the baseline LSTM model, as we are able to skip a bunch of text. Conclusions. The model we develop can indeed learn how to “jump” while processing text, which is faster than most of the existing methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Quality of The UHT Skim Milk as Affected by Addition of Rennet Skim Milk

Background and Objectives: Consumption of whole dairy products has declined due to the awareness of possible harmful effects of fat on consumers’ health. The purpose of the present paper was to investigate the possibility of substituting the Ultra-high temperature processing (UHT) whole milk with partially hydrolyzed κ-casein to manufacture a UHT skim milk. Materials and Methods: UHT skim milk ...

متن کامل

The Effect of Text Color and Background Color on Skim Reading Webpages in Thai

There are many sets of guidelines concerning the accessibility of the web for older adults, but little empirical evidence from studies with older people to support their recommendations. In addition, all the recommendations apply to text in languages using the Latin alphabet. This study investigated the effects of text color and background color on the performance and preferences of younger and...

متن کامل

ارائه مدلی برای استخراج اطلاعات از مستندات متنی، مبتنی بر متن‌کاوی در حوزه یادگیری الکترونیکی

As computer networks become the backbones of science and economy, enormous quantities documents become available. So, for extracting useful information from textual data, text mining techniques have been used. Text Mining has become an important research area that discoveries unknown information, facts or new hypotheses by automatically extracting information from different written documents. T...

متن کامل

Text skimming: the process and effectiveness of foraging through text under time pressure.

Is Skim reading effective? How do readers allocate their attention selectively? The authors report 3 experiments that use expository texts and allow readers only enough time to read half of each document. Experiment 1 found that, relative to reading half the text, skimming improved memory for important ideas from a text but did not improve memory of less important details or of inferences made ...

متن کامل

The Interaction of Gender with Text Enhancement and Meta-cognitive Grammar Instruction on Learning and Recall of English Grammar

The current research was an effort to study the interaction of gender with text enhancement and meta-cognitive grammar instruction on learning and recall of English grammar. To this end, two groups of students consisting of 51 learners from both genders were formed. The participants were 51 male and 51 female learners. The 51 participants of each gender were further divided into two groups. The...

متن کامل

The Interaction of Gender with Text Enhancement and Meta-cognitive Grammar Instruction on Learning and Recall of English Grammar

The current research was an effort to study the interaction of gender with text enhancement and meta-cognitive grammar instruction on learning and recall of English grammar. To this end, two groups of students consisting of 51 learners from both genders were formed. The participants were 51 male and 51 female learners. The 51 participants of each gender were further divided into two groups. The...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017